Overview

Dataset Statistics

Number of Variables 26
Number of Rows 201
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 196.8 KB
Average Row Size in Memory 1002.5 B
Variable Types
  • Categorical: 16
  • Numerical: 10

Dataset Insights

width is skewed Skewed
engine-size is skewed Skewed
compression-ratio is skewed Skewed
city-mpg is skewed Skewed
normalized-losses has a high cardinality: 52 distinct values High Cardinality
horsepower has a high cardinality: 59 distinct values High Cardinality
drive-wheels has constant length 3 Constant Length

Variables


symboling

categorical

Approximate Distinct Count 6
Approximate Unique (%) 3.0%
Missing 0
Missing (%) 0.0%
Memory Size 13291

Length

Mean 1.1244
Standard Deviation 0.3308
Median 1
Minimum 1
Maximum 2

Sample

1st row 3
2nd row 3
3rd row 1
4th row 2
5th row 2

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 25
Decimal Number 201
  • The top 2 categories (0, 1) take over 50.0%

normalized-losses

categorical

Approximate Distinct Count 52
Approximate Unique (%) 25.9%
Missing 0
Missing (%) 0.0%
Memory Size 13544
  • The largest value (?) is over 3.36 times larger than the second largest value (161)

Length

Mean 2.3831
Standard Deviation 0.7794
Median 3
Minimum 1
Maximum 3

Sample

1st row ?
2nd row ?
3rd row ?
4th row 164
5th row 164

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 442

make

categorical

Approximate Distinct Count 22
Approximate Unique (%) 10.9%
Missing 0
Missing (%) 0.0%
Memory Size 14372
  • The largest value (toyota) is over 1.78 times larger than the second largest value (nissan)

Length

Mean 6.5025
Standard Deviation 2.2916
Median 6
Minimum 3
Maximum 13

Sample

1st row alfa-romero
2nd row alfa-romero
3rd row alfa-romero
4th row audi
5th row audi

Letter

Count 1296
Lowercase Letter 1296
Space Separator 0
Uppercase Letter 0
Dash Punctuation 11
Decimal Number 0
  • The largest value (toyota) is over 1.78 times larger than the second largest value (nissan)

fuel-type

categorical

Approximate Distinct Count 2
Approximate Unique (%) 1.0%
Missing 0
Missing (%) 0.0%
Memory Size 13728
  • The largest value (gas) is over 9.05 times larger than the second largest value (diesel)

Length

Mean 3.2985
Standard Deviation 0.9002
Median 3
Minimum 3
Maximum 6

Sample

1st row gas
2nd row gas
3rd row gas
4th row gas
5th row gas

Letter

Count 663
Lowercase Letter 663
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (gas, diesel) take over 50.0%
  • The largest value (gas) is over 9.05 times larger than the second largest value (diesel)

aspiration

categorical

Approximate Distinct Count 2
Approximate Unique (%) 1.0%
Missing 0
Missing (%) 0.0%
Memory Size 13740
  • The largest value (std) is over 4.58 times larger than the second largest value (turbo)

Length

Mean 3.3582
Standard Deviation 0.7688
Median 3
Minimum 3
Maximum 5

Sample

1st row std
2nd row std
3rd row std
4th row std
5th row std

Letter

Count 675
Lowercase Letter 675
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (std, turbo) take over 50.0%
  • The largest value (std) is over 4.58 times larger than the second largest value (turbo)

num-of-doors

categorical

Approximate Distinct Count 3
Approximate Unique (%) 1.5%
Missing 0
Missing (%) 0.0%
Memory Size 13777

Length

Mean 3.5423
Standard Deviation 0.5563
Median 4
Minimum 1
Maximum 4

Sample

1st row two
2nd row two
3rd row two
4th row four
5th row four

Letter

Count 710
Lowercase Letter 710
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (four, two) take over 50.0%

body-style

categorical

Approximate Distinct Count 5
Approximate Unique (%) 2.5%
Missing 0
Missing (%) 0.0%
Memory Size 14394

Length

Mean 6.6119
Standard Deviation 2.0171
Median 5
Minimum 5
Maximum 11

Sample

1st row convertible
2nd row convertible
3rd row hatchback
4th row sedan
5th row sedan

Letter

Count 1329
Lowercase Letter 1329
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (sedan, hatchback) take over 50.0%

drive-wheels

categorical

Approximate Distinct Count 3
Approximate Unique (%) 1.5%
Missing 0
Missing (%) 0.0%
Memory Size 13668
  • The largest value (fwd) is over 1.57 times larger than the second largest value (rwd)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row rwd
2nd row rwd
3rd row rwd
4th row fwd
5th row 4wd

Letter

Count 595
Lowercase Letter 595
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 8
  • The top 2 categories (fwd, rwd) take over 50.0%
  • The largest value (fwd) is over 1.57 times larger than the second largest value (rwd)
  • drive-wheels has words of constant length

engine-location

categorical

Approximate Distinct Count 2
Approximate Unique (%) 1.0%
Missing 0
Missing (%) 0.0%
Memory Size 14067
  • The largest value (front) is over 66.0 times larger than the second largest value (rear)

Length

Mean 4.9851
Standard Deviation 0.1216
Median 5
Minimum 4
Maximum 5

Sample

1st row front
2nd row front
3rd row front
4th row front
5th row front

Letter

Count 1002
Lowercase Letter 1002
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (front, rear) take over 50.0%
  • The largest value (front) is over 66.0 times larger than the second largest value (rear)

wheel-base

numerical

Approximate Distinct Count 52
Approximate Unique (%) 25.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 3216
Mean 98.797
Minimum 86.6
Maximum 120.9
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • wheel-base is skewed right (γ1 = 1.0235)

Quantile Statistics

Minimum 86.6
5-th Percentile 93
Q1 94.5
Median 97
Q3 102.4
95-th Percentile 110
Maximum 120.9
Range 34.3
IQR 7.9

Descriptive Statistics

Mean 98.797
Standard Deviation 6.0664
Variance 36.8008
Sum 19858.2
Skewness 1.0235
Kurtosis 0.8953
Coefficient of Variation 0.0614
  • wheel-base is not normally distributed (p-value 2.6542376663706646e-06)
  • wheel-base has 3 outliers

length

numerical

Approximate Distinct Count 73
Approximate Unique (%) 36.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 3216
Mean 174.201
Minimum 141.1
Maximum 208.1
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • length is skewed right (γ1 = 0.1533)

Quantile Statistics

Minimum 141.1
5-th Percentile 157.3
Q1 166.8
Median 173.2
Q3 183.5
95-th Percentile 197
Maximum 208.1
Range 67
IQR 16.7

Descriptive Statistics

Mean 174.201
Standard Deviation 12.3222
Variance 151.836
Sum 35014.4
Skewness 0.1533
Kurtosis -0.09328
Coefficient of Variation 0.07074
  • length is not normally distributed (p-value 0.003764062340043798)
  • length has 1 outliers

width

numerical

Approximate Distinct Count 43
Approximate Unique (%) 21.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 3216
Mean 65.8891
Minimum 60.3
Maximum 72
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • width is skewed right (γ1 = 0.8685)

Quantile Statistics

Minimum 60.3
5-th Percentile 63.6
Q1 64.1
Median 65.5
Q3 66.6
95-th Percentile 70.3
Maximum 72
Range 11.7
IQR 2.5

Descriptive Statistics

Mean 65.8891
Standard Deviation 2.1015
Variance 4.4162
Sum 13243.7
Skewness 0.8685
Kurtosis 0.6322
Coefficient of Variation 0.03189
  • width is not normally distributed (p-value 6.730016473629004e-11)
  • width has 11 outliers

height

numerical

Approximate Distinct Count 49
Approximate Unique (%) 24.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 3216
Mean 53.7667
Minimum 47.8
Maximum 59.8
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • height is skewed right (γ1 = 0.029)

Quantile Statistics

Minimum 47.8
5-th Percentile 49.7
Q1 52
Median 54.1
Q3 55.5
95-th Percentile 57.5
Maximum 59.8
Range 12
IQR 3.5

Descriptive Statistics

Mean 53.7667
Standard Deviation 2.4478
Variance 5.9918
Sum 10807.1
Skewness 0.02896
Kurtosis -0.4519
Coefficient of Variation 0.04553
  • height is not normally distributed (p-value 9.493234867082433e-07)

curb-weight

numerical

Approximate Distinct Count 169
Approximate Unique (%) 84.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 3216
Mean 2555.6667
Minimum 1488
Maximum 4066
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • curb-weight is skewed right (γ1 = 0.7005)

Quantile Statistics

Minimum 1488
5-th Percentile 1905
Q1 2169
Median 2414
Q3 2926
95-th Percentile 3505
Maximum 4066
Range 2578
IQR 757

Descriptive Statistics

Mean 2555.6667
Standard Deviation 517.2967
Variance 267595.9033
Sum 513689
Skewness 0.7005
Kurtosis 0.00435
Coefficient of Variation 0.2024
  • curb-weight is not normally distributed (p-value 7.080488478064016e-05)
  • curb-weight has 2 outliers

engine-type

categorical

Approximate Distinct Count 6
Approximate Unique (%) 3.0%
Missing 0
Missing (%) 0.0%
Memory Size 13692
  • The largest value (ohc) is over 9.67 times larger than the second largest value (ohcf)

Length

Mean 3.1194
Standard Deviation 0.7111
Median 3
Minimum 1
Maximum 5

Sample

1st row dohc
2nd row dohc
3rd row ohcv
4th row ohc
5th row ohc

Letter

Count 627
Lowercase Letter 627
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (ohc, ohcf) take over 50.0%
  • The largest value (ohc) is over 9.67 times larger than the second largest value (ohcf)

num-of-cylinders

categorical

Approximate Distinct Count 7
Approximate Unique (%) 3.5%
Missing 0
Missing (%) 0.0%
Memory Size 13848
  • The largest value (four) is over 6.54 times larger than the second largest value (six)

Length

Mean 3.8955
Standard Deviation 0.4172
Median 4
Minimum 3
Maximum 6

Sample

1st row four
2nd row four
3rd row six
4th row four
5th row five

Letter

Count 783
Lowercase Letter 783
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (four, six) take over 50.0%
  • The largest value (four) is over 6.54 times larger than the second largest value (six)

engine-size

numerical

Approximate Distinct Count 43
Approximate Unique (%) 21.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 3216
Mean 126.8756
Minimum 61
Maximum 326
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • engine-size is skewed right (γ1 = 1.9643)

Quantile Statistics

Minimum 61
5-th Percentile 90
Q1 98
Median 120
Q3 141
95-th Percentile 194
Maximum 326
Range 265
IQR 43

Descriptive Statistics

Mean 126.8756
Standard Deviation 41.5468
Variance 1726.1395
Sum 25502
Skewness 1.9643
Kurtosis 5.332
Coefficient of Variation 0.3275
  • engine-size is not normally distributed (p-value 1.5735572194784251e-09)
  • engine-size has 10 outliers

fuel-system

categorical

Approximate Distinct Count 8
Approximate Unique (%) 4.0%
Missing 0
Missing (%) 0.0%
Memory Size 13848

Length

Mean 3.8955
Standard Deviation 0.3066
Median 4
Minimum 3
Maximum 4

Sample

1st row mpfi
2nd row mpfi
3rd row mpfi
4th row mpfi
5th row mpfi

Letter

Count 705
Lowercase Letter 705
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 78
  • The top 2 categories (mpfi, 2bbl) take over 50.0%

bore

categorical

Approximate Distinct Count 39
Approximate Unique (%) 19.4%
Missing 0
Missing (%) 0.0%
Memory Size 13857

Length

Mean 3.9403
Standard Deviation 0.42
Median 4
Minimum 1
Maximum 4

Sample

1st row 3.47
2nd row 3.47
3rd row 2.68
4th row 3.19
5th row 3.19

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 591

stroke

categorical

Approximate Distinct Count 37
Approximate Unique (%) 18.4%
Missing 0
Missing (%) 0.0%
Memory Size 13857

Length

Mean 3.9403
Standard Deviation 0.42
Median 4
Minimum 1
Maximum 4

Sample

1st row 2.68
2nd row 2.68
3rd row 3.47
4th row 3.40
5th row 3.40

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 591

compression-ratio

numerical

Approximate Distinct Count 32
Approximate Unique (%) 15.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 3216
Mean 10.1643
Minimum 7
Maximum 23
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • compression-ratio is skewed right (γ1 = 2.5651)

Quantile Statistics

Minimum 7
5-th Percentile 7.5
Q1 8.6
Median 9
Q3 9.4
95-th Percentile 21.9
Maximum 23
Range 16
IQR 0.8

Descriptive Statistics

Mean 10.1643
Standard Deviation 4.005
Variance 16.0397
Sum 2043.02
Skewness 2.5651
Kurtosis 4.914
Coefficient of Variation 0.394
  • compression-ratio is not normally distributed (p-value 1.454053492315302e-15)
  • compression-ratio has 27 outliers

horsepower

categorical

Approximate Distinct Count 59
Approximate Unique (%) 29.3%
Missing 0
Missing (%) 0.0%
Memory Size 13557
  • The largest value (68) is over 1.9 times larger than the second largest value (69)

Length

Mean 2.4478
Standard Deviation 0.5182
Median 2
Minimum 1
Maximum 3

Sample

1st row 111
2nd row 111
3rd row 154
4th row 102
5th row 115

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 490
  • The largest value (68) is over 1.9 times larger than the second largest value (69)

peak-rpm

categorical

Approximate Distinct Count 23
Approximate Unique (%) 11.4%
Missing 0
Missing (%) 0.0%
Memory Size 13863

Length

Mean 3.9701
Standard Deviation 0.2985
Median 4
Minimum 1
Maximum 4

Sample

1st row 5000
2nd row 5000
3rd row 5000
4th row 5500
5th row 5500

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 796

city-mpg

numerical

Approximate Distinct Count 29
Approximate Unique (%) 14.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 3216
Mean 25.1791
Minimum 13
Maximum 49
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • city-mpg is skewed right (γ1 = 0.6753)

Quantile Statistics

Minimum 13
5-th Percentile 16
Q1 19
Median 24
Q3 30
95-th Percentile 37
Maximum 49
Range 36
IQR 11

Descriptive Statistics

Mean 25.1791
Standard Deviation 6.4232
Variance 41.2578
Sum 5061
Skewness 0.6753
Kurtosis 0.7056
Coefficient of Variation 0.2551
  • city-mpg is not normally distributed (p-value 8.62938841009031e-09)
  • city-mpg has 2 outliers

highway-mpg

numerical

Approximate Distinct Count 30
Approximate Unique (%) 14.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 3216
Mean 30.6866
Minimum 16
Maximum 54
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • highway-mpg is skewed right (γ1 = 0.5454)

Quantile Statistics

Minimum 16
5-th Percentile 22
Q1 25
Median 30
Q3 34
95-th Percentile 42
Maximum 54
Range 38
IQR 9

Descriptive Statistics

Mean 30.6866
Standard Deviation 6.8151
Variance 46.4463
Sum 6168
Skewness 0.5454
Kurtosis 0.5176
Coefficient of Variation 0.2221
  • highway-mpg is not normally distributed (p-value 0.0007288034723078299)
  • highway-mpg has 3 outliers

price

numerical

Approximate Distinct Count 186
Approximate Unique (%) 92.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 3216
Mean 13207.1294
Minimum 5118
Maximum 45400
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • price is skewed right (γ1 = 1.7961)

Quantile Statistics

Minimum 5118
5-th Percentile 6189
Q1 7775
Median 10295
Q3 16500
95-th Percentile 32528
Maximum 45400
Range 40282
IQR 8725

Descriptive Statistics

Mean 13207.1294
Standard Deviation 7947.0663
Variance 6.3156e+07
Sum 2.6546e+06
Skewness 1.7961
Kurtosis 3.122
Coefficient of Variation 0.6017
  • price is not normally distributed (p-value 6.035075047031222e-06)
  • price has 14 outliers

Interactions

Correlations

Missing Values